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We investigate a model of evolutionary dynamics on a smooth landscape which features a "mutator" 
allele whose effect is to increase the mutation rate. We show that the expected proportion of 
mutators far from equilibrium, when the fitness is steadily increasing in time, is governed solely by 
the transition rates into and out of the mutator state. This results is a much faster rate of fitness 
increase than would be the case without the mutator allele. Near the fitness equilibrium, however, 
the mutators are severely suppressed, due to the detrimental effects of a large mutation rate near 
the fitness maximum. We discuss the results of a recent experiment on natural selection of E. coli 
in the light of our model. 



In a recent paper Q, Sniegowski et. al. presented re- 
sults concerning the mutation rate of a series of E. coli 
populations undergoing natural selection in a laboratory 
setting. They found that three out of twelve populations 
had mutated to states with much higher mutation rates, 
becoming, in the language of population genetics, "muta- 
tors" . These changes were traced to the disabling of some 
specific DNA repair mechanisms Q| in each of the three 
mutators. While similar results had been obtained in 
chemostats utilizing artificially hobbled bacterial strains 
P,B this is the first time that natural populations had 
shown selection for more rapid mutation. These results 
have important implications for a variety of topics, in- 
cluding the debate over "directed mutation" || and the 
accumulation of multiple mutations in cancer cells 

Models of mutator selection have in general taken one 
of two forms. On one hand, Painter M and others || have 
considered the case of a small number of genetic " states" 
and used mean-field theory to predict mutator popula- 
tion. In the simplest realization, a bacterium can be in 
one of four states, mutator/non- mutator and high/low 
fitness. While this approach may be suitable for the 
aforementioned chemostat experiments, it is inadequate 
to describe the slow overall improvement in fitness seen in 
long-term natural evolution ]^] . At the opposite extreme, 
Taddei (h]] et. al. have produced a complex model by 
combining measured data on E. coli with other more arbi- 
trary assumptions. This model, while in principle useful, 
does not easily lend itself to analysis and the identifica- 
tion of which details are essential and which not. 

In this paper, we propose a simple approach to muta- 
tor selection based on the fitness landscape being smooth. 
In previous work Jl^Jl^] , we and our collaborators have 
studied a simple model of evolution in a smooth land- 
scape. This model has shown itself very amenable to ana- 
lytic treatment and in addition capable of explaining var- 
ious experimental data on the evolution of RNA viruses 
JTT| . By extending the model to include the possibility 
of a mutator state, we will show that the basic features 



of the experimental data can be understood, provided 
that the transition rate from the "mutator" state to the 
normal, non-mutator state is much slower than the time- 
scale of the experiment. The model then predicts that 
the system at the time of measurement is in fact near a 
crossover between a far-from-equilibrium state where the 
system is "climbing" the fitness hill and the equilibrium 
state; this is consistent with the measured fitness data 
|^| . The model allows us to make a number of other pre- 
dictions which can hopefully be tested by future results 
of the ongoing experiment. 

Our basic model assigns a "fitness", i.e. reproduction 
rate, to each individual in a population. This fitness is 
determined by its genome, which consists of L binary 
genes, with values and 1, and is simply the number of 
l's in the genome. Reproduction for each individual is 
modeled as a Poisson process whereby a new individual 
with the same fitness (modulo mutation( as the parent is 
added to the population. At the same time as this asexual 
reproduction event occurs, one of the existing members 
of the population is chosen at random and "killed" , so as 
to maintain a fixed population size N. (The fact that in 
the experiment growth is allowed to take place for some 
time before the population is culled to its original size, 
resulting in a population that oscillates between two val- 
ues, is an inessential complication for our considerations.) 
Mutation may accompany birth so that, with probability 
/j,, one of the genes of the baby is chosen at random and 
flipped. The usual assumption in the population genetics 
literature that most mutations are deleterious arises here 
as a consequence of a population lying close to the fitness 
peak, so that most genes have the value 1. This creates 
an imbalance in the overall probability of moving up or 
down the landscape via mutation. The analysis of this 
model and its application to the RNA virus experiments 
has been given elsewhere Jl|Jl|. 

To apply this framework to the E. coli experiments, we 
add one special two-allele gene which controls the muta- 
tion rate. With the mutator allele, the mutation rate 
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is increased from \i to Xfj, without any direct effect on 
fitness. We take 07, (a b ) to be the forward (backward) 
mutation rate to (from) the mutator state. 

We first analyze the equilibrium state of the popula- 
tion. Assuming N ^> L, we can use mean- field theory 
and ignore any fluctuations in the population. This leads 
immediately to the equations 

= P x = (x - x)P x + fJ,x(j) x P x -l + (1 - Px)Px+l - Px) 

- a f xP x + a b xQ x 
= Q x = (x - x)Q x + \iix(p x Q x -i + (1 - p x )Q x+ i - Q x ) 
+ GjxP x - cr b xQ x (1) 

Here, p x is the probability that a mutation results in 
moving up in fitness, which equals 1—x/L. P x and Q x are 
respectively the normal and mutator population fractions 
at fitness x, and x is the mean fitness of the population, 
considering both normal and mutator types on an equal 
footing. To solve this equation, we assume that L is large 
and define y = L — x. To leading order in L, 

= (y - y)P y + lxL(-P y + P y _ x ) - a f LP v + a b LQ y (2) 
= {y - y)Qy + \^L(-Q y + Qy-i) + o s LPy - a b LQ y (3) 

This equation also applies for y = if we set P_j = 
Q-i =0. At y = 0, we obtain 



Qo — Po 
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a b L + A/iL — y 
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as well as a quadratic equation for y. While this equation 
is messy, it is easy to check that if A = f, then y = 
/iL, giving a b Qo — 07 Po as expected by naive balance 
between the forward and backward transitions. On the 
other hand, if A is large, y — (/i + trj)L; this is easily seen 
since Qo ~ (er//A/i)Po is then small and can be dropped 
from the Po equation. 

To make further progress, we let A be large. Then, to 
leading order, the P subpopulation decouples and 



Py=Po 



r(y + i) 



(5) 



where Po is fixed by normalization to Po = e~' lL . This is 
precisely the result in the absence of the mutator state. 
The Q distribution follows from the inhomogeneous re- 
cursion relation, eq. (2), with the sources fixed by the 
known P y . Solving this equation, we see that the relative 
narrowness of the P distribution essentially "collapses" 
all the source terms to y = 0, giving the simple result for 
f < y < A, 
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(A/xL) y r(A/xL + f ) 



r(A/xL + y + l) 
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Thus, even though each individual Q y is of order trj/A/i, 
the width of the q distribution is large, of order ^JX[iL, 



so the total number of fast individuals is much larger, 
a f Jl^. One can further check this calculation by com- 
puting the value of y from the P and Q distributions. The 
mean y of the normal type is just /iL, but the small num- 
ber of fast type have anomalously high y and contribute 
a total of <7fL to the total fitness without changing the 
total number of individuals significantly, reproducing the 
result obtained above. 

The reason for the suppression of the fast type is clear, 
and has been appreciated for a long time 14 1 . The "nat- 
ural" equilibrium of the fast type, if there were no slow 
type around, would place them A times as far from the 
fitness peak than the slow type. Thus, the faster muta- 
tion rate of the fast species places a "mutational load" 
on them, so they lose out in competition with the nor- 
mal species and are suppressed. The noteworthy part of 
the calculation is the prediction size of the suppression, 
which would be difficult to guess a priori. 

We now turn to the far-from-equilibrium case where 
the typical fitness of an individual is close to L/2 on the 
scale of L. Here, mutations will produce movement up or 
down the landscape with equal probability. In this case, 
with just the normal type present, we found [fl3f that the 
population increases its mean fitness at a constant rate 
on average. In this regime, mean-field theory is an utter 
failure, leading jl^,|l3| to finite-time singularities caused 
by the inability to properly account for the essential vari- 
ance limiting property of birth-death processes |l5| . We 
turn instead to an approach |]l3f which involves truncat- 
ing the state space of the exact Markov process, utilizing 
the assumption that the mutation probabilities /1, cr/ and 
<j b are much smaller than unity. To see the structure of 
the problem, we focus first on the case where the pop- 
ulation consists of only two individuals, i.e. N = 2. In 
this case, the states we need to consider are those with 
the two individuals having the same fitness x, either both 
normal, (fll) x , both mutator, (f22) x , or one normal and 
one mutator, (fl2) x . In addition, we need to consider the 
states where the two individuals have adjacent fitnesses 
at x and x + 1, with both being either normal, (gll) x , 
or mutator, (<?22) x . The states (gl2) x and (g21) x where 
the two individuals having different fitness and type are 
of order fiaj and are therefore dropped along with the 
states where the two individuals differ in fitness by more 
than 1, which are higher order in /1. Truncating the mas- 
ter equation to these states yields 

{fll) x = -2»x(fll) x + ^x(gll) x + ^( 5 11) X _! 

- 4a f x(fll) x + h(fl2) x (7) 

and similar equations for the other 4 densities. In the 
case with just normal individuals, we found |ll| that the 
long-time solution had the scaling form f x = ^F(x/t), 
which indicates directly the linear growth of fitness with 
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time. Examining our system of equations, we see that 
such a scaling form is not possible unless 

a b f '(22) = 07/(11) + (A22)(x/t)/(xt 2 ) (8) 
/(12) = 8^/(11) + (Al2)(x/t)/(xt 2 ) (9) 

The physical meaning of these relationships is that the 
system must quickly evolve to a state of "local equilib- 
rium" between the normal and fast types before it can 
achieve the linear-velocity climbing state. In this "local 
equilibrium" , the ratio of fast to normal types is given 
by 2(/22) + /12)/2((/ll) + /12 + /22) = a f /a b , just 
as it would if there were no fitness degrees of freedom. 
Substituting this scaling form into the truncated master 
equations and dropping all time derivatives of the g's and 
of /12 (they are second order quantities), we get, after 
eliminating the A terms 



(i + ^)/h = (i 



^i){| [(/ll)' + x(fll)"]} (10) 



This equation shows that the velocity has been renor- 
malized by the factor (a f X + at) / (c f +<Jb), which is very 
reasonable given the ratio of mutator to normal types and 
the fact that the velocity is exactly linear in the mutation 
rate and the number of individuals. 

The above methodology can be extended to arbitrarily 
large ./V with the basic results concerning the percent- 
age of mutators and the renormalization of the velocity 
unchanged. Basically, they follow immediately from the 
above scaling structure of the long-time solution. In Fig. 
1, we show the prediction of this analysis along with di- 
rect simulations of our Markov process. The agreement 
of the theory with the asymptotic state is quite satis- 
factory. There is however a transient period in which 
a larger number of mutators are selected. This latter 
behavior is due to the fact that evolutionary advance 
of the initial population (consisting entirely of wild-type 
cells) occurs most readily by creating mutators; eventu- 
ally, these mutators start making back transitions and 
establish the steady-state ratio. This transient response 
is analogous to what has been termed "hitch-hiking" in 
the genetics literature. We see that this naive picture 
of the selection of mutators in the climbing state breaks 
down rather quickly and the true situation is one of bal- 
ance based solely on the forward and backward rates. 

In a given experiment, the population is placed in a 
new environment and allowed to evolve towards the fit- 
ness peak. Initially, the rate of fitness improvement will 
be correlated with the mutator percentage in any given 
realization, with the average effect as given above. Even- 
tually, the system will begin to approach equilibrium and 
the mutators will start to be at a disadvantage. Roughly, 
the crossover should take place when the average fitness 
equals the equilibrium mean fitness of the mutator type 
considered alone, namely a distance A/iL from the peak. 
After this point, it is advantageous for the population to 



shed its mutators and continue to climb to its final rest- 
ing point, a distance (/i + <Jf)L from the top. In essence, 
then, the system gets a "free- ride" from the presence of 
the mutators. They help the system to get close to the 
fitness peak, but then politely bow out of the picture so 
as not to significantly impair the final mean fitness. 
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FIG. 1. Percentage of mutators in the far-from-equilibrium 
case, with 07 = 0.001, Ob = 0.004, together with the asymp- 
totic prediction, fi = 0.01, A = 10, N = 100, averaged over 
400 realizations. 

We now turn to a discussion of the results of the exper- 
iment detailed in Ref. Jl| . This work reported that three 
out of twelve populations fixed completely the mutator 
genotype; yet there was no clear correlation between fit- 
ness and mutator status at the current epoch. While 
it is possible that fluctuations are dominating the aver- 
age behavior, we will assume the experimental results 
reflect the typical behavior of the system and ask how 
they relate to our model. It is clear that the only way 
to make our model correspond to the experimental find- 
ings is to take the backward mutation rate a b to be so 
small as to play no role on the time-scale of the experi- 
ment. Otherwise, one would have expected to see rever- 
sions from the mutator state to the normal state, which 
were not observed. Biologically, this vanishingly small 
(Th corresponds to the mutation of the DNA repair mech- 
anism being due to something other than a (reversible) 
point mutation. This assumption of no back-mutations 
implies that it is possible to completely fix the muta- 
tor gene, the percentage of systems doing so increasing 
monotonically with time. The rate of fixation becomes 
vanishingly small as the mean fitness in the non-fixed 
systems approaches the equilibrium mutator fitness. Af- 
ter this crossover point, the mutators become suppressed 
and fixation becomes extremely unlikely. This behavior 
is indicated in Fig. 2, in which a set of 40 populations 
experienced nine instances of mutator fixation. The fact 
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that no additional fixations were seen in the last third of 
the experiment indicates that the system was near this 
crossover point at the end of the run. This fact is consis- 
tent with the leveling off of the rate of fitness improve- 
ment in the experiment, as report in Ref. || . Near 
the crossover point, there is no clear correlation between 
mutator status and fitness, as can be verified from our 
simulations (see Fig. 3). This lack of correlation is also 
consistent with the experimental findings. 
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FIG. 2. Fraction of populations which have not completely 
fixed the mutator gene, for a simulation with 40 indepen- 
dent realizations of our stochastic evolution model, fi = .005, 
a s = .0005, a b = 0, A = 15, L = 100, N = 1000. After the 
times shown, no more populations became mutator-fixed. 

If our understanding proves correct, we can make the 
following predictions for what should be seen as the ex- 
periment is continued past the times reported. Since ab 
is small, no reversions should be seen. Since we are near 
the crossover, we expect no additional mutator fixations. 
More significantly, the correlation between fitness and 
mutator status should have been positive in the past and 
should become negative in the future. This can be tested 
by studying isolates from fixed times in the past and 
by repeating the measurements as the evolution contin- 
ues. Also, we predict that the percentage of mutation 
fixations should depend on the initial distance from the 
fitness peak; that is, how much the experimental growth 
conditions depart from the usual wild- type habitat. 

To summarize, we have presented a simple model 
which describes how the dynamics of mutation rate 
changes affects and is affected by natural selection. The 
analytic tractability of our evolutionary dynamics allows 
us to focus directly on the issues which determine selec- 
tion for or against mutator states. Our various predic- 
tions for the experiment should test whether this type 
of generic analysis is powerful enough to make reliable 
statements regarding these complex biological processes. 



90 



85 - 



C/) 

(/) 

CD 



80 - 



75 



70 



fixed 

non-fixed 




J. J y 
/ / 

1 / 








/ s 

/V 

i.i. 





10 



20 
Time 



30 



40 



FIG. 3. Fitness vs. time, for the simulations described in 
Figure 2, plotted separately the average fitness over the subset 
of our populations which fix (9 out of 40) or do not fix (31 
out of 40) the mutator gene sometime during the run. 
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